On the Sum of the Heights of Sturmian Factors 



Kevin O 'Bryant 
February 2, 2008 

Abstract 

A binary word is a map : N — ^ {0, 1}, and the set of factors of W 
with length n is F„{W) := {{W(i),wli + 1), . . . ,W{i + n - 1)) : i > 0}. 
A word is Sturmian if = n + 1 for every n > 1. We show that 

the sum of the heights (also known as hamming weights) of the n + 1 
factors with length n of a binary Sturmian word has the same parity as 
n, independent of W. 

Many facts are known about the factors of length n of a Sturmian word W. 
Among the many noteworthy results are: that Fn(W) is closed under reversals 
(the map that takes (wi, . . . , it;„) to (w„,...,wi)) [H Prop 2.1.19]; that the 
volume of the convex hull of Fn{W) is 1/nl, independent of [3l Thm 1.1]; 
and that as W varies over all Sturmian words, Fn{W) takes on '^(*) values. 

We direct the reader to either [4, Chap 9] or 2, Chap 2] for an introduction to 
Sturmian words. To these we add 

Theorem 1. For every binary Sturmian word W and every positive integer n, 

h[w) = n (mod 2), 

where h{w) is the number of components of w that are '1'. 

A natural approach to proving this is to observe that since Fn{W) is closed 
under reversal, we can pair off non-palindrome factors that have the same height 
h{w), and therefore it suffices to consider only the palindromes in Fn{W). More- 
over, if a palindrome has even length, then it must have even height, and so the 
'even-n case' of our theorem does follow easily from the 'closure under reversal' 
property. When n is odd, the situation is more complicated as a palindrome 
may have even or odd weight, and there are always two [Ij: 

{(l,0,l,0,l,0,l),(l,l,0,l,0,l,l)}ci^7(ci/^), 

where c^/^ is a particular Sturmian word defined below. Our proof does not 
follow this line, and does not make use of closure under reversal. 

This result (and other computations) suggests that the eigenvalues of the 
Gram matrix Gain) := (m; • Wj)i<,j<n+i, where F„(VF) = {wi, . . . , w„+i}, 




Figure 1: The multiplicity of 1 as an eigenvalue of G2^(^^_i^{n) 



may have structure. Note that the eigenvalues of a Gram matrix do not depend 
on the ordering of the vectors, and are necessarily nonnegative real numbers. 
A particularly striking phenomenon is the following. Set m{n) to be the mul- 
tiplicity of 1 as an eigenvalue of G2^(^^_i^{n). For example m(55) = 13 and 
m(65) = 0. Figure [1] shows an impressive amount of structure, but this author 
has no explanation for why any structure would exist as n changes. Similar 
pictures result from considering other irrationals. 

1 The route of the proof 

Let [x\ denote the floor of x, and {x} the fractional part of x, i.e., x = [a:J -|-{a;}. 
Define Ba{k) := I < q < k, {qa} < {ka}}, which counts the number of 

integers in [1, k) that are 'better' denominators for approximating a from below. 

Our proof proceeds by connecting the sum in Theorem[T]to Ba{n) (for some 
a), finding a recurrence satisfied by Ba(n), and then reducing that recurrence 
modulo 2. 

The characteristic word with slope a is defined by 

Ca{n) := l{n + 2)a\ - [{n + l)a\ . 

If a is irrational, then Cq, is a Sturmian word [2, Thm 2.1.13]. It is known 
[2l Thm 2.1.3, Prop 2.1.18] that for every binary Sturmian word W and natural 
number n, there is an a e (0, 1) with F„(VF) = F„(cq), and so it suffices for our 
purposes to consider characteristic words, and to write F„(a) := F„(cq,). 

Lemma 1. h{w) — Ba{n) + {n + 1) [naj + 1. 

Lemma 2. Let a £ (0,1/2) be irrational. Then Ba{k) + i?i_Q,(fc) — k — 1. 
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Moreover, Ba{l) = 0, Ba(2) = 1, and fork>3 

{1 - fc, {ka} e [0, a); 
k-1, {fca}e[a,2a); 
0, {fca}e[2a, 1). 

Lemma 3. Let a G (0, 1) be irrational, and k any positive integer. If k is odd, 
then Ba{k) is even. If k is even, then Ba{k) = [fcaj + 1 (mod 2). 

2 Proofs 

Proof of LemmaUi We begin by following [H Lem 10.5.1]; define tTj by 
{0 = TTo < TTi < 7r2 < • ■ • < 7r„} = {0, {-a}, {-2a}, {-na}}. 
Set Vi{x) := [{i + l)a + x\ — [ia + x\ , and set 

:= {vo{TTi),Vl{TTi),V2{'!rt), . . . ,Vn-l{TTi)). 

Nontrivially (see H]), Fn(a) = {m; : < z < n}, and wq, wi, . . . ,Wn are ordered 
lexicographically. Elementary examination yields hiwi) — \LC\ {TTi,na + 7ri]|, 
and this last quantity is either \na\ or [naj + 1. We start with h{wo) = lna\, 
and the first i for which h{wi) = \na\ + 1 is the i for which na + iTi ^ Z, that 
is, when {—na} = tt^. In other words, the last Ba{n) + 1 factors have weight 
\na\ + 1 and the first n + 1 — {Ba{n) + 1) factors have weight [na\ . This gives 

J2 Kw) = {B^{n) + l){[na\ + 1) + (n + 1 - (B„(n) + 1)) \na\ 

= l + Ba{n) + (n+ 1) \na\ . 

□ 

Our proof of Lemma [2] is similar in spirit to, and was directly inspired by, 
Sos's proof of the Three-Gap Theorem [5]. 

Proof of Lemma[M Observe that < {qa} < {ka} iff {fc(l-a)} < {q{l-a)} < 
1, so that q with 1 < g < fc is in either the set {g : 1 < g < fc, {qa} < {ka}} or 
in the set {g : 1 < g < fc, {g(l — a)} < {fc(l — a)}}, and is not in both (as a is 
irrational, {fc(l - a)} ^ {g(l - a)}). Thus, B„(fc) + Bi_„(fc) = fc - 1. 

We think of the fc + 2 numbers 0, {a}, . . . , {ka}, 1 as lying on a unit circle, 
and labele d Pp , Pi, ... , Pk, Po, respectively, i.e., Pj := e^'^J^v^ = g2^ba}y=T^ 
"The arc PiPj" refers to the half-open counterclockwise arc from Pi to Pj, 
containing Pi but not Pj. We say that three distinct points A, B, C are in order 
if B ^ CA. We say that A, B, C, D are in order if both A, B, C and C, D, A 
are in order. Essentially, if when moving counter-clockwise around the circle 
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starting from A, we encounter first the point B, then C, then D, and finally A 
(again), then A, B, C, D are in order. 

By rotating the circle through an angle of 27Ta, so that Pi > Pi+i {0 < i < 
k), we find that each P on the arc Pk-2Pk-i is rotated onto a P on the arc 
Pk-iPk- Specifically, the number of Pq, ^i, ■ • • , Pk-2 on Pfe_2-Pfe-i is the same 
as the number of Pi, P2, ■ ■ • , Pk-i on Pk-iPk- Set 

X {Po,Pi,...,P;,_2} and y := {Pi, P2, . . . , Pfe_i}, 

so that what we have observed is 



\x n Pk-2Pk-i\ = \Y n Pk-iPk\ ■ (1) 



Also, we will use the definition of Ba in the forms Ba{k) — \Y r\ PoPk\ and 
Ba{k — 1) = \X n PoPfc_i I — 1, and with k and k—1 replaced by fc — 1 and k — 2, 
when circumstances allow. 

Now, first, suppose that {ka} e [0, a), so that the points Pq, Pk, Pk-2, Pk-i 
are in order on the circle. We have 

X n Pk-2Pk-i =xn (PoPfe-i \ PoPfc_2) 



= {xn PoPfc_i) \ [x n PoPfc-2j 



\xnPk-2Pk-i\ = \{xnPoPk-i) \ - \{xnl^K^\ 

= (B„(fc-l)-l)-(P„(fc-2)-l) 
= Ba{k-l)-Bc,{k-2), 



and similarly 



Y n Pfc_iPfc = (r n Pfc-iPo) u (r n PoP^) 

= (r\(ynivv7)) u(rnTO) 



\YnPk-iPk\ = {\Y\ - |rnPoPfc_i|) + \YnPoPk\ 

= (fc-f-P„(fc-l))+Pa(fc) 

so that Eq. (P) becomes B^ik - 1) - Ba{k - 2) = Ba{k) - Ba{k - f ) + fc - 1, 
as claimed in the statement of this lemma. 

Now suppose that {fca} G [a, 2a), so that the points Pq, Pk-i, Pk, Pk-2 are 
in order. By arguing as in the above case, we find 

X n Pk-2Pk-i ^{x\{xn PoPk-2)) u (X n PoPfc-i) , 



and so \X n Pk-2Pk-i \ =k-l - {B{k - 2) - 1) + (P(fc - 1) - 1). Likewise, 



Y n Pfe-iPfe = (r n PoPfe) \ {y n PoP^-i) 



so that \Y n Pfc-iPfe] = Ba{k) — Ba{k — 1). Thus, in this case Eq. ([T]) becomes 
Ba{k — 1) — Ba{k — 2) + fc — f = Ba{k) — Ba{k — 1), as claimed in the statement 
of the lemma. 
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Finally, suppose that {ka} G [2a, 1), so that the points Pq, Pk-2, Pk-i, Pk 
are in order. We find 



X n Pk-2Pk-i ^{xn PoPk-i) \ (x n PaPk-2 



and so \X n Pk-2Pk-i \ = B{k - 1) - B{k - 2). Also, 

YnPk-iPk = (rnTp^) \ (rnTpvT) 



and so |y n P/c-iPfel = Ba{k) — Ba{k — 1). As claimed, Eq. ([T]) becomes i?Q (/c — 
l)-B„(fc-2) = B„(fc)-B„(fc-l). □ 



For the remaining proofs, we write [Q] 



1, Q is true; 
0, Q is false. 



Proof of Lemma\^ Reducing Lemma [2] modulo 2, we find that if < a < 1/2, 
then 

B„(fc) = B„(fc - 2) + {k even] [{ka} < 2a]. 
and if 1/2 < a < 1, then 

Ba{k) ^ -Bi_„(fc) + fc - 1 = Bi-a(fc) + [fc even! (mod 2) 

We work in four cases: k may be odd or even, and a may be less than or greater 
than 1/2. 

Assume first that k is odd and < a < 1/2. As Ba{l) — and Ba{k) = 
Ba{k — 2) (mod 2), we see by induction that Ba{k) is even. 

Now assume that k is odd and 1/2 < a < 1. We have Ba{k) = i?i_Q,(fc) 
(mod 2) , and asO< 1 — Q!< 1/2, the paragraph immediately above implies that 
Bi-a{k) is even. 

Now assume that k is even and < a < ^. Set [3 — 2a, k — 2i and 
B'{i) = B^{2i), so that B'{1) = 1 and B'{i) = B'{i - 1) + |{i/3} < /?] (mod 2). 
We have 

P„(fc) = Bo,{2l) = B'(^) = B'{1 - 1) + < /?| (mod 2) 

= i3'(l)+^I{z/3}</3| (mod 2) 

1=2 

= B'(l) + = 1 + L(fc/2)(2a)J = 1 + [fcaj , 

since X]i=2l{*/^} ^ f^l' '^^^^^ ^ C^' -'^)' counts the integers in the interval (/3, £(3]. 

Finally, suppose that k is even and 1/2 < a < 1. By the paragraph imme- 
diately above, Bi-a{k) = 1 + [k{l — a)\ (mod 2). We have 

Ba (fc) = 1 + (fc) (mod 2) 
= [fc(l - q)J (mod 2) 
= [fcaj + 1 (mod 2), 

where we have again used the irrationality of a in the last line. □ 
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Proof of Theorem\^ Now, if n is odd, then by Lemma [31 Ba{n) = (mod 2) 
and obviously n+l = (mod 2), whence 1 + Ba{n) + (n + l) [na\ = 1 (mod 2). 
If n is even, then by Lemma El Ba{n) = [na\ + 1, whence 1 + Ba{n) + (n + 
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